An expectation maximization approach for formant tracking using a parameter-free non-linear predictor

نویسندگان

  • Issam Bazzi
  • Alex Acero
  • Li Deng
چکیده

This paper presents a new approach for formant tracking using a parameter-free non-linear predictor that maps formant frequencies and bandwidths into the acoustic feature space. The approach relies on decomposing the speech signal into two components: the first component captures the mapping between formants and acoustic observations, while the second component is intended to capture the residual in the signal. We build the mapping by quantizing the formant space and creating a predictor codebook. Formant tracking is achieved by: 1) EM training of the parameters of the residual component, and 2) searching the predictor codebook for the best formant values. We explore both MAP and MMSE methods for performing formant tracking with the proposed approach. Furthermore, we impose first order continuity constraints on formant trajectories, and use Viterbi search to perform formant tracking. We present formant tracking results on data from the Switchboard corpus.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Formant frequency tracking using Gaussian mixtures with maximum a posteriori adaptation

We present a novel method for estimating formant frequencies by fitting Gaussian mixtures to discrete Fourier Transform (DFT) magnitude spectra. The method first estimates the Gaussian parameters for a sequence of wideband spectra using the Expectation-Maximization (EM) algorithm. It then refines the parameters by using maximum a posteriori (MAP) adaptation. The work was evaluated using manuall...

متن کامل

Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint

A technique for high-accuracy tracking of formants or vocal tract resonances is presented in this paper using a novel nonlinear predictor and using a target-directed temporal constraint. The nonlinear predictor is constructed from a parameter-free, discrete mapping function from the formant (frequencies and bandwidths) space to the LPC-cepstral space, with trainable residuals. We examine in thi...

متن کامل

This is a placeholder. Final title will be filled later

A technique for high-accuracy tracking of formants or vocal tract resonances is presented in this paper using a novel nonlinear predictor and using a target-directed temporal constraint. The nonlinear predictor is constructed from a parameter-free, discrete mapping function from the formant (frequencies and bandwidths) space to the LPC-cepstral space, with trainable residuals. We examine in thi...

متن کامل

Speech enhancement for linear-predictive-analysis-by-synthesis coders

Speech coding techniques commonly used in low bit rate analysis-by-synthesis linear predictive coders (LPAS coders) create a model that emphasizes the important features of a speech signal. The utilization of these coding methods for speech enhancement is shown. Specifically, the speech signal will be modeled as the output of a cascade of an adaptive formant filter and an adaptive pitch filter,...

متن کامل

Position Error Modeling Using Gaussian Mixture Distributions With Application to Comparison of Tracking Algorithms

In this paper Gaussian mixtures are used to model the distribution of position error in tracking algorithms. An expectation maximization algorithm is constructed to estimate parameters of a k-component Gaussian mixture based on a sample set obtained from a tracking simulator. The modeling and parameter estimation approach is applied to position error data generated by several tracking algorithm...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003